Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims: 

Claim 1 (original): A method comprising: 

receiving a first program unit in a parallel computing environment, the first program unit 
including a reduction operation associated with a set of variables; 

4 translating the first program unit into a second program unit, the second program unit to 

associate the reduction operation with a set of one or more instructions operative to partition the 
reduction operation between a plurality of threads including at least two threads; and 

translating the first program unit into a third program unit, the third program unit to associate 
the reduction operation with a set of one or more instructions operative to perform an algebraic 
v operation on the variables. 

Claim 2 (original): The method of claim 1 further comprising encapsulating the reduction 
operation with the instructions associated with the third program unit. 

Claim 3 (original): The method of claim 1 further comprising reducing the variables 
logarithmically. 

Claim 4 (original): The method of claim 1 further comprising translating the first 
program unit into the second program unit utilizing, in part, a source-code to source-code translator. 

Claim 5 (original): The method of claim 1 further comprising translating the first 
program unit into the third program unit utilizing, in part, a source-code to source-code translator. 

Claim 6 (original): The method of claim 1 further comprising associating the plurality of 
threads each with a unique portion of the set of variables. 

Claim 7 (original): The method of claim 6 further comprising combining, in part, the 
variables associated with the plurality of threads in a pair-wise reduction operation. 

Claim 8 (currently amended): An apparatus comprising: 

a memory including a shared memory location; 

a translation unit coupled with the memory, the translation unit to translate a first program 
unit including a reduction operation associated with a set of at least two variables into a second 



2 



program unit, the second program unit to associate the reduction operation with one or more 
instructions operative to partition the reduction operation between a plurality of threads including at 
least two threads , the translation unit to also translate the first program unit into a third program unit, 
the third program unit to associate the reduction operation with a set of one or more instructions 
operative to perform an algebraic operation on the variables ; 

a compiler unit coupled with the translation unit and the [[shared-]]memory, the compiler 
unit to compile the second program unit and the third program unit : and 

; > a linker unit coupled with the compiler unit and the [[shared-]] memory, the linker unit to 

link the compiled second program unit and the compiled third program unit with a library. 

Claim 9 (currently amended): The apparatus of claim 8 wherein the second program unit is 
to associate associat e s a set of one or more instructions with the reduction o p e rativ e operation to 
encapsulate the reduction operation. 

Claim 10 (original): The apparatus of claim 8 wherein the variables in the set of variables 
are each uniquely associated with the plurality of threads and the library includes instructions 
operative to combine, in part, the variables associated with the plurality of threads. 

Claim 1 1 (original): The apparatus of claim 10 wherein the library includes instructions 
operative to combine, in part, the variables in a pair-wise reduction. 

Claim 12 (original): The apparatus of claim 8 further comprising a set of one or more 
processors to host the plurality of threads, the plurality of threads to execute instructions associated 
with the second program unit. 

Claim 13 (currently amended): The apparatus of claim 8 wherein the second program unit 
includes a callback routine and the callback routine is associated with instructions operative to 
perform an algebraic operation on at least two variables in the set of variables. 

Claim 14 (original): The apparatus of claim 13 wherein the library is operative to call the 
callback routine to perform, in part, a reduction on at least two variables in the set of variables. 

Claim 15 (original): A machine-readable medium that provides instructions, that when 
executed by a set of one or more processors, enable the set of processors to perform operations 
comprising: 
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receiving a first program unit in a parallel computing environment, the first program unit 
including a reduction operation associated with a set of variables; 

translating the first program unit into a second program unit, the second program unit to 
associate the reduction operation with a set of one or more instructions operative to partition the 
reduction operation between a plurality of threads including at least two threads; and 

translating the first program unit into a third program unit, the third program unit to associate 
the reduction operation with a set of one or more instructions operative to perform an algebraic 
operation on the variables. 

Claim 16 (currently amended): The machine-readable medium of claim 15 further comprising 
instructions for encapsulating the reduction operation with a set of one or more instructions. 

Claim 17 (currently amended): The machine-readable medium of claim 15 further comprising 
instructions for translating the first program unit into the second program unit utilizing, in part, a 
source-code to source-code translator. 

Claim 18 (currently amended): The machine-readable medium of claim 15 further comprising 
instructions for reducing the variables, in part, logarithmically. 

Claim 19 (currently amended): The machine-readable medium of claim 15 further comprising 
instructions for translating the first program unit into the third program unit utilizing, in part, a 
source-code to source-code translator. 

Claim 20 (currently amended): The machine-readable medium of claim 15 further comprising 
wherein the instructions cause the second program unit utilizin g to utilize , in part, the third program 
unit to perform a reduction operation on the set of variables. 
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